首页> 外文OA文献 >Genre effects on automatic sentence segmentation of speech: A comparison of broadcast news and broadcast conversations
【2h】

Genre effects on automatic sentence segmentation of speech: A comparison of broadcast news and broadcast conversations

机译:类型对语音自动句子分割的影响:广播新闻和广播对话的比较

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

We investigate genre effects on the task of automatic sentence segmentation,focusing on two important domains – broadcast news(BN) and broadcast conversation (BC). We employ an HMM modelbased on textual and prosodic information and analyze differencesin segmentation accuracy and feature usage between the two genresusing both manual and automatic speech transcripts. Experimentsare evaluated using Czech broadcast corpora annotated for sentencelikeunits (SUs). Prosodic features capture information about pause,duration, pitch, and energy patterns. Textual knowledge sources includewords, part-of-speech, and automatically induced classes. Wealso analyze effects of using additional textual data that is not annotatedfor SUs. Feature analysis reveals significant differences in bothtextual and prosodic feature usage patterns between the two genres.The analysis is important for building automatic understanding systemswhen limited matched-genre data are available, or for designingeventual genre-independent systems.
机译:我们主要针对两个重要领域-广播新闻(BN)和广播对话(BC),研究类型对自动句子分割任务的影响。我们使用基于文本和韵律信息的HMM模型,并使用手动和自动语音笔录分析两种类型之间的细分准确性和特征使用差异。使用注释了句子单元(SU)的捷克广播语料库对实验进行了评估。韵律特征捕获有关暂停,持续时间,音调和能量模式的信息。文字知识源包括单词,词性和自动归类。我们还将分析使用未标注给SU的其他文本数据的影响。特征分析揭示了两种类型的文本和韵律特征使用模式之间的显着差异。该分析对于在可用有限匹配类型的数据时构建自动理解系统或设计最终独立于类型的系统非常重要。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号